Concordancing the Web with KWiCFinder
نویسنده
چکیده
The World Wide Web is a wondrous place, with an overwhelming variety of information in countless languages and domains. Just how many webpages there are and how they are distributed by language and content are not easy questions to answer. The Web is constantly growing and changing, and even the best estimates can only approximate its extent and composition. Studies of the nature of the Web echo the story of the blind men and the elephant: each extrapolates from different samples of an ever-evolving entity taken at different times and by divergent means. The most reliable estimates suggest that the number of publicly-indexable webpages in mid-2001 falls in the range of three to five billion (i.e. thousand million = 10), a number projected to grow to 10-15 billion by mid-decade; others believe these figures too conservative and place the actual numbers at two to three times as many.
منابع مشابه
Exploring Parallel Concordancing in English and Chinese
This paper investigates the value of computer technology as a medium for the delivery of parallel texts in English and Chinese for language learning. An English-Chinese parallel corpus was created for use in parallel concordancing -a technique which has been developed to respond to the desire to study language in its natural contexts of use. Specific problems of dealing with Chinese characters ...
متن کاملEffects of Web-based Concordancing Instruction on EFL Students’ Learning of Verb –Noun Collocations
This study investigates the influence of using five web-based practice units on English verb-noun collocations with the design of a web-based Chinese-English bilingual concordancer (keyword retrieval program) on collocation learning. Thirty-two college EFL students participated by taking a pre-test and two post-tests, and responding to a background questionnaire and an evaluation questionnaire....
متن کاملConcordancing for parallel spoken language corpora
Concordancing is one of the oldest corpus analysis tools, especially for written corpora. In NLP concordancing appears in training of speech-recognition system. Additionally, comparative studies of different languages result in parallel corpora. Concordancing for these corpora in a NLP context is a new approach. We propose to combine these fields of interest for a multi-purpose concordance for ...
متن کاملTransSearch: A Free Translation Memory on the World Wide Web
A translation memory is an archive of existing translations, structured in such a way as to promote translation re-use. Under this broad definition, an interactive bilingual concordancing tool like the RALI’s TransSearch system certainly qualifies as a translation memory. This paper describes the Web-based version of TransSearch, which, for the last three years, has given Internet users access ...
متن کاملConcordancing the Web
The World Wide Web is a wondrous place, with an overwhelming variety of information in countless languages and domains. Just how many webpages there are and how they are distributed by language and content are not easy questions to answer. The Web is constantly growing and changing, and even the best estimates can only approximate its extent and composition. Studies of the nature of the Web ech...
متن کامل